Model Selection

Synthetic data training

# Synthetic data training

Phi 4 Mini Reasoning GGUF

Phi-4-mini-reasoning is a lightweight open model built on synthetic data, focusing on high-quality, reasoning-rich data, and further fine-tuned for more advanced mathematical reasoning capabilities.

Large Language Model

Smartshot Zeroshot Finetuned V0.1.2

A zero-shot classification model fine-tuned based on roberta-base-zeroshot-v2.0-c, enhanced with SmartShot method and synthetic data

Text Classification Other

Smolvlm 500M Anime Caption V0.1

A vision-language model specialized in describing anime-style images, fine-tuned from SmolVLM-500M-Base, trained on 180K synthetic image/caption pairs generated by large language models.

Image-to-Text English

Gliner Biomed Base V1.0

GLiNER-Biomedical Edition is a specialized biomedical named entity recognition model developed based on the GLiNER framework, capable of identifying multiple biomedical entity types.

Sequence Labeling

PyTorch English

Gec Spanish BARTO SYNTHETIC

A Spanish grammar correction model based on the BART architecture, trained on the COWS-L2H dataset and 80,984 synthetic data entries, optimized for single-sentence correction

Text Generation

Transformers Supports Multiple Languages

EVA Qwen2.5 72B V0.2

A large language model fine-tuned based on Qwen2.5-72B, specializing in text generation and instruction-following tasks

Large Language Model

Depth Anything V2 Metric Outdoor Large Hf

A fine-tuned version of Depth Anything V2 for outdoor metric depth estimation tasks, trained on the synthetic dataset Virtual KITTI

Gliclass Large V1.0

An efficient zero-shot classifier trained on synthetic data, suitable for topic classification, sentiment analysis, and reranking tasks in RAG workflows.

Text Classification

Transformers English

Gliclass Base V1.0

GLiClass is an efficient zero-shot classifier inspired by GLiNER, suitable for text classification, sentiment analysis, and reranking tasks in RAG workflows.

Text Classification

Transformers English

Gliclass Base V1.0 Lw

GLiClass is an efficient zero-shot classifier trained on synthetic data, suitable for text classification, sentiment analysis, and reranking tasks in RAG workflows.

Text Classification

Transformers English

Llama 3 Instruct 8B SPPO Iter3

A large language model developed in the third iteration using the Self-Play Preference Optimization method based on the Meta-Llama-3-8B-Instruct architecture.

Large Language Model

Transformers English

Gemma is an advanced open-source model trained on high-quality datasets, supporting different context length requirements.

Large Language Model

Gliclass Large V1.0 Init

GLiClass is an efficient zero-shot classifier trained on synthetic data, suitable for topic classification, sentiment analysis, and reranking tasks in RAG workflows.

Text Classification

Transformers English

T5 Base Spell Correction Fr

This model is based on the T5 architecture, specifically designed to correct spelling and punctuation errors in French text.

Text Generation

Transformers French

Bert Base Cased NER Reranker

BERT-based Named Entity Recognition (NER) context reranking model for evaluating the helpfulness of contextual sentences for NER predictions

Sequence Labeling

Transformers English

Dhenu Vision Lora 0.1

An agricultural disease detection model fine-tuned based on Qwen-VL-chat, specializing in disease identification and treatment recommendations for three major crops: rice, corn, and wheat.

Transformers English

A Russian and English spelling correction model based on the mT5-large architecture, normalizing words to correct spelling and typographical errors.

Large Language Model

Transformers Supports Multiple Languages

Nous Hermes 2 Mistral 7B DPO AWQ

Nous Hermes 2 is a next-generation flagship 7B Hermes model based on Mistral 7B DPO, optimized with DPO and demonstrating excellent performance across multiple benchmarks.

Large Language Model

Transformers English

Openmath CodeLlama 7b Python Hf

The OpenMath model is specifically designed for solving mathematical problems by integrating textual reasoning with Python interpreter-executed code blocks. Trained on the OpenMathInstruct-1 dataset containing 1.8 million math problem-solution pairs.

Large Language Model

Transformers Supports Multiple Languages

A 7B-parameter causal language model compatible with Meta LLaMA 2 architecture, outperforming similar models under 33B in multiple evaluations

Large Language Model

Transformers Supports Multiple Languages

Phi Hermes 1.3B

Phi-1.5 model fine-tuned on the Hermes dataset, primarily used for text generation tasks

Large Language Model

Transformers English

Esm2 T6 8M UR50D Sequence Classifier V1

A small sequence classifier trained based on the ESM-2 protein language model, capable of classifying protein sequences into three categories: enzymes, receptor proteins, and structural proteins.

Transformers English

AmelieSchreiber

This is a 13-billion-parameter language model based on the LlaMa architecture, fine-tuned using synthetic data, primarily for research purposes.

Large Language Model

An OCR system based on Transformer architecture, specifically designed for recognizing Central Kurdish text, trained using synthetic data.

Text Recognition

Trocr Base Printed Synthetic Dataset Ocr

A fine-tuned printed text recognition model based on microsoft/trocr-base-printed, optimized for synthetic OCR datasets

Text Recognition

Transformers English

Paraphraser Bart Large

An automatic paraphrase model based on BART-large architecture, trained on the ParaBank 2 dataset, capable of generating high-quality English sentence paraphrases

Text Generation

T5 Base Multi Sentence Doctor

A T5-based model for correcting sentence errors in English, German, and French texts

Large Language Model

Featured Recommended AI Models

AIbase

Empowering the Future, Your AI Solution Knowledge Base

English 简体中文繁體中文にほんご

© 2025AIbase